Quality Data, Quality Decisions: Why Web Scraping is Essential for Advanced Analytics
Gediminas Rickevičius·9 min

Source: KDNuggets
What is Machine Learning? Machine learning is the ability of computers to learn new things autonomously. The learning process is based on data, past experience, and observations. The more data the computer processes, the better it becomes in the conclusions it makes. And this is exactly why machine learning algorithms have become an integral part of the financial markets’ DNA. How is it Used? The trading process has evolved massively, to a state where traders employ sophisticated parameters and combinations of factors to come up with a decision. From social sentiment scores, through technical indicators, to fundamental information – investing today is more complicated than ever. Machine learning has the potential to ease the whole process by analyzing large chunks of data, spotting significant patterns and generating a single output that navigates traders towards a particular decision based on predicted asset prices. How Does it Work in Practice? In their core, financial markets tend to be unpredictable and even illogical, just like the outcome of the Brexit vote or the last US elections. Due to these characteristics, financial data should be deemed to possess a rather chaotic structure[1] which often makes it hard to find sustainable patterns. In order to solve this, the algorithm should be fed with as much unbiased information as possible.
Modeling chaotic structures requires machine learning algorithms capable of finding hidden laws within the data structure and predict how they will affect them in the future. The most efficient methodology to achieve this is “Deep Learning”. Deep learning can deal with complex structures easily and extract relationships that further increase the accuracy of the generated results. Here's a guide to building deep learning models to help you get a better understanding.
The way machine learning in stock trading works does not differ much from the approach human analysts usually employ. The first step is to organize the data set for the preferred instrument. It is then divided into two main groups – a training set and a test set. Why is that? Before the algorithm is tested, it needs to be trained and fine-tuned which is what the training set serves for. After it becomes clear that the algorithm fits all requirements, it is then put into action with the test set. After the algorithm generates a result, it is then compared to the real-life performance of the particular stock.
An Example of the Logic Behind a Machine Learning Algorithm for Stock Trading
There are plenty of ways to build a predictive algorithm. However, most of them usually follow the logic presented below as it is an easy and efficient way for basic stock market predictions:
| Signal Effect | ||||||
| Top 50 predicted | 3 days | 1 week | 2 weeks | 1 month | 3 months | A year |
| Top 20 signals | 0.21% | 0.35% | 0.67% | 1.35% | 4.04% | 19.25% |
| Top 20 signals | 0.25% | 0.37% | 0.69% | 1.21% | 4.06% | 19.63% |
| Top 10 signals | 0.22% | 0.38% | 0.81% | 1.33% | 4.58% | 21.95% |
| Top 5 signals | 0.24% | 0.52% | 1.19% | 1.67% | 4.96% | 22.78% |
| Benchmark | 0.13% | 0.23% | 0.50% | 1.07% | 3.09% | 14.63% |
| Outperformance | 60.46% | 51.94% | 34.49% | 26.03% | 30.71% | 31.61% |
Source: BusinessPundit
With an academic and professional background in the field of Finance and IT, Bingran has a deep appreciation for data-driven decision making. He is a fan of all things data related. Currently building his dream tool to help everyday people make smart investment decisions.